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BACKGROUND OF THE INVENTION 

Field of Invention 

The present invention pertains to the field of 
5 designing graph structures. More particularly, this 
invention relates to deriving a genome representation 
for evolving the weights in a graph structure. 

Art Background 

10 A variety of disciplines including computer 

science commonly express solutions to problems in the 
form of graph structures. For example, neural 
networks which are commonly used in computer-related 
applications may be expressed in the form of graph 

15 structures. 

A graph structure typically includes a set of 
nodes and a set of arcs that provide interconnections 
among the nodes. Each arc of a graph structure 
20 usually has an associated weight. The design of a 
graph structure typically involves determining an 
appropriate arrangement of nodes and arcs and 
determining an appropriate weight for each arc. 

25 One prior method for determining the weights in 

a graph structure is to use genetic programming 
techniques to evolve the weights. A typical genetic 
programming method involves generating an initial 
population of organisms each of which is a candidate 

30 solution for the weights, selecting a subset of 
organisms from the initial population for use as 
parents of a generation of child organisms, and 
generating the child organisms by combining genetic 
material from the parent organisms using genetic 
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operators such as mutation and cross-over. 
Typically, many generations of child organisms are 
generated and tested before a suitable set of weights 
is found. 

5 

The genetic operators of mutation and crossover 
are usually applied to an arrangement of genetic 
material which is commonly referred to as a genome 
representation for the weights. It is usually 

10 desirable to employ a genome representation that will 
yield the most efficient evolution to a desired 
solution. For example, a reduction in the number of 
generations of organisms that are generated and 
evaluated usually decreases the time it takes to 

15 reach a desired solution and decreases the overall 
design cost of a graph structure. 
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SUMMARY OF THE INVENTION 



A method is disclosed for deriving a genome 
representation for the weights in a graph structure 
that increases the likelihood that solutions to 
substructures of the graph structure will be 
preserved during crossover operations. The 
preservation of solutions to substructures across 
generations of organisms may decrease the time and 
costs associated with evolving a suitable set of 
weights . 

A method for designing a graph structure 
according to the present teachings includes 
determining a genome representation for a set of 
weights for a set of arcs in the graph structure such 
that the arcs of the graph structure that participate 
in a substructure of the graph structure are in a 
close proximity in the genome representation. The 
graph structure may then be evolved using the genome 
representation . 

Other features and advantages of the present 
invention will be apparent from the detailed 
description that follows. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



The present invention is described with respect 
to particular exemplary embodiments thereof and 
5 reference is accordingly made to the drawings in 
which : 



Figure 1 shows a method for designing a graph 
structure which incorporates the present teachings; 

Figures 2a-2b illustrate crossover operations 
which may be used to generate a child organism from , 
pair of parent organisms when evolving the graph 
structure ; 

Figure 3 shows one example of a graph structure 
which includes an arrangement of nodes and 
interconnecting arcs with associated weights. 
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DETAILED DESCRIPTION 



Figure 1 shows a method for designing a graph 
structure which incorporates the present teachings. 
5 At step 100, an arrangement of nodes and arcs for the 
graph structure is determined. Any one or more of a 
variety of known technigues may be employed at step 
100 including designs by hand and automated methods. 

10 At step 102, a genome representation for the 

weights of the graph structure from step 100 is 
determined. The genome representation is determined 
at step 102 such that the arcs of the graph structure 
that participate in a substructure of the graph 

15 structure are in a close proximity in the genome 
representation. This arrangement for the genome 
representation increases the likelihood that the 
weights associated with a substructure will be 
preserved during the formation of a next generation 

20 of organisms using crossover. 

At step 104, the graph structure is evolved 
using the genome representation obtained at step 102. 
Step 104 may be performed using any one or more of a 
25 variety of known genetic programming techniques. 

Figures 2a-2b illustrate crossover operations 
which may be used at step 104 to generate a child 
organism from a pair of parent organisms when 
30 evolving the graph structure 200. In this example, 
the parent and child organism each have genetic 
material made up of a sequence of bits. 
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Alternatively, the genetic material of an organism 
may be a sequence of numbers. 



Figure 2a shows a one-point crossover operation 
5 20 which combines a parent organism 30 with a parent 
organism 32 to yield a child organism 34. The one- 
point crossover operation 20 combines a sequence of 
genetic material 10010 from the parent organism 30 as 
a prefix with a sequence of genetic material 00100 

10 from the parent organism 32 as a suffix to yield a 

sequence of genetic material 1001000100 in the child 
organism 34. The crossover point in this example is 
between the fifth locus and sixth locus of the 
sequence but in general may be located anywhere in 

15 the sequence. The locus of crossover may be randomly 
chosen each time. 

Figure 2b shows a two-point crossover operation 
22 which combines a sequence of genetic material 100- 

20 10 from the parent organism 30 with a sequence of 

genetic material 11001 from the parent organism 32 to 
yield a sequence of genetic material 1001100110 in 
the child organism 34. The crossover points in this 
example are between the third locus and fourth locus 

25 of the sequence and between the eighth locus and 
ninth locus of the sequence but in general may be 
located at any two positions in the sequence. 

Figure 3 shows one example of a graph structure 
30 200 which includes an arrangement of nodes 10-17 and 
interconnecting arcs which are referred to by their 
associated weights W1-W10. In one embodiment, the 
graph structure 200 represents a neural network. 
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The example graph structure 200 includes a 
substructure of the arcs Wl, W2 , W5, and W6 and a 
substructure of the arcs Wl, W2, W9, and W10 and a 
substructure of the arcs W3, W4, W7 and W8 . A prior 
art breadth-first ordering of the weights W1-W10 of 
the graph structure 200 would result in the following 
sequence of genetic material in a genome 
representation . 



W5 W6 W7 W8 W9 W10 Wl W2 W3 W4 



This genome representation yielded by prior art 
techniques would break up the sequence of genetic 
material for the substructure of the arcs Wl, W2, W5, 
and W6 when cross-over is applied between the first 
locus and the eighth locus. Similarly, the above 
genome representation yielded by prior art techniques 
would break up the sequence of genetic material for 
the substructure of the arcs W3 , W4, W7 and W8 when 
cross-over is applied between the third locus and the 
tenth locus. 

The breakup of this genetic material prevents 
possibly fit solutions for the substructure of the 
arcs Wl, W2, W5, and W6 and for the substructure of 
the arcs W3, W4, W7 and W8 from being passed on to a 
next generation of organisms during evolution. Such 
a failure to pass on fit solutions can increase the 
time and cost associated with evolving the graph 
structure 200. 

According to the present techniques, a genome 
representation is determined at step 102 by 
multiplying a connection matrix element-by-element 
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with a weight matrix to yield a product matrix. At 
step 102, a score is generated by summing the 
elements of the product matrix. A minimum value for 
the score is determined at step 102 by exchanging and 
5 the rows and columns of the connection matrix while 
re-computing the score until a minimum value is 
obtained. The connection matrix that yields the 
minimum score provides the genome representation to 
be used at step 104. 

10 

The connection matrix is generated such that 
each row represents a node and each column represents 
an arc, initially in some order. The value of an 
element in the connection matrix is one if the node 
15 represented by the row and the column represented by 
the arc are connected and is zero otherwise. 



The weight matrix is of the same size as the 
connection matrix. Each element (i , j ) of the weight 
20 is given by the following equation: 

element (i, j) =\D*i-N*j\ 

where D is the total number of columns, i.e. arcs, 
and N is the total number of rows, i.e. nodes. The 
value of an element of the weight matrix indicates an 
amount by which the element is "off diagonal." For 
25 example, if the number of rows is four and the number 
of columns is four, the weight matrix is as follows. 



30 



0 


4 


8 


12 


4 


0 


4 


8 


8 


4 


0 


4 


12 


8 


4 


0 
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Each cell of the weight matrix directly encodes 
the distance off the diagonal multiplied by the 
number of rows which is an immaterial constant 
factor. If the number of rows is three and the 
5 number of columns is five, the weight matrix is as 
follows . 



0 


3 


6 


9 


12 


2 


2 


1 


4 


7 


10 


7 


4 , 


1 


5 



10 

The elements of the product matrix when summed 
equal a score which is lower when arcs that share 
nodes are closer together according to the linear 
ordering given by the row indices. The product 

15 matrix is iteratively generated by exchanging rows 
with other rows and columns with other columns so 
long as the score continues to diminish. In one 
embodiment, the score is determined for all exchanges 
of the first row with the other rows. This yields a 

20 locally optimal position for the weight referenced by 
the first row. The score is then determined for all 
exchanges of the second row with the other rows to 
yield a locally optimal position for the weight 
referenced by the second row, and so on. Thereafter, 

25 the score is determined for all exchanges of the 
first column with the other columns, then for all 
exchanges of the second column with the other 
columns, and so on. The product matrix having the 
lowest score yields an optimal arrangement of weights 

30 according to a metric which indicates proximity or 
compactness of weights associated with the same 
substructure . 
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For the example graph structure 200 having 8 
nodes and 10 arcs, the weight matrix is a follows. 



0 


8 


16 


24 


32 


40 


48 


56 


64 


72 


10 


2 


6 


14 


22 


30 


38 


46 


54 


62 


20 


12 


4 


4 


12 


20 


28 


36 


44 


52 


30 


22 


14 


6 


2 


10 


18 


26 


34 


42 


40 


32 


24 


16 


8 


0 


8 


16 


24 


32 


50 


42 


34 


26 


18 


10 


2 


6 


14 


22 


60 


52 


44 


36 


28 


20 


12 


4 


4 


12 


70 


62 


54 


46 


38 


30 


22 


14 


6 


2 



For the breadth-first ordering of the weights 
W1-W10 given by the seguence W5 W6 W7 W8 W9 W10 Wl W2 
W3 W4, the connection matrix is as follows where N10- 
15 N17 refer to nodes 10-17, respectively. 





W5 


W6 


W7 


W8 


W9 


W10 


Wl 


W2 


W3 


W4 


N10 


1 


1 


1 


1 


0 


0 


0 


0 


0 


0 


Nil 


0 


0 


0 


0 


1 


1 


0 


0 


0 


0 


N12 


1 


0 


0 


0 


1 


0 


1 


0 


0 


0 


N13 


0 


1 


0 


0 


0 


1 


0 


1 


0 


0 


N14 


0 


0 


1 


0 


0 


0 


0 


0 


1 


0 


N15 


0 


0 


0 


1 


0 


0 


0 


0 


0 


1 


N16 


0 


0 


0 


0 


0 


0 


1 


1 


0 


0 


N17 


0 


0 


0 


0 


0 


0 


0 


0 


1 


1 



25 
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This yields the following product matrix. 





W5 


W6 


W7 


W8 


W9 


W10 


Wl 


W2 


W3 


W4 


N10 


0 


8 


1 6 
















Nil 


0 


0 


0 


0 


22 


3 0 


0 


0 


0 




N12 


20 


0 


0 


0 














N13 


0 


22 


0 


0 


0 


10 


0 


26 


0 


0 


N14 


0 


0 


24 


0 


0 


0 


0 


0 


24 


0 


N15 


0 


0 


0 


26 


0 


0 


0 


0 


0 


22 


N16 


0 


0 


0 


0 


0 


0 


12 


4 


0 


0 


N17 


0 


0 


0 


0 


0 


0 


0 


0 


6 


2 



The score for this product matrix is 338. 
15 The product matrix having the lowest score in 



this example is as follows. 





W5 


Wl 


W2 


W6 


W7 


W3 


W8 


W4 


W9 


W10 


N12 


0 


8 


0 


0 


0 


0 


0 


0 


64 


0 


N16 


0 


2 


6 


0 


0 


0 


0 


0 


0 


0 


N13 


0 


0 


4 


0 


12 


0 


0 


0 


0 


52 


N10 


30 


0 


0 


6 


2 


0 


18 


0 


0 


0 


N14 


0 


0 


0 


0 


8 


0 


0 


0 


0 


0 


N17 


0 


0 


0 


0 


0 


10 


0 


6 


0 


0 


N15 


0 


0 


0 


0 


0 


0 


12 


4 


0 


0 


Nil 


0 


0 


0 


0 


0 


0 


0 


0 


2 


1 



This product matrix yields a score of 247. The 
connection matrix that corresponds this product 



matrix having the lowest score is as follows. 
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W5 


Wl 


W2 


W6 


W7 


W3 


W8 


W4 


W9 


WIO 


N12 


1 


1 


0 


0 


0 


0 


0 


0 


1 


0 


N16 


0 


1 


1 


0 


0 


0 


0 


0 


0 


0 


N13 


0 


0 


1 


0 


1 


0 


0 


0 


0 


1 


N10 


1 


0 


0 


1 


1 


0 


1 


0 


0 


0 


N14 


0 


0 


0 


0 


1 


1 


0 


0 


0 


0 


N17 


0 


0 


0 


0 


0 


1 


0 


1 


0 


0 


N15 


0 


0 


0 


0 


0 


0 


1 


1 


0 


0 


Nil 


0 


0 


0 


0 


0 


0 


0 


0 


1 


1 



10 

These product and connection matrices yield the 

following sequence of genetic material in an 

optimized genome representation according to the 

present teachings. 
15 

W5 Wl W2 W6 W7 W3 W8 W4 W9 W10 



This linear sequence of weights provided by the 
optimized genome representation may be translated 

20 into a data structure representing the graph 

structure 200 in several ways. For example, the 
internal representation of the graph data structure 
may contain the appropriate index into the genome 
representation either directly or through a table 

25 indexed by an inherent link index. Whenever a weight 
is needed, this index is used to extract the correct 
number from the genome representation. In another 
example, a table may map the index in genome-order to 
the intrinsic index in the graph data structure. 

30 Before the graph structure is evaluated, this table 
is traversed and the weights from the genome are 
patched into the data structure. 
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In an alternative embodiment, the connection 
matrix is a square matrix whose rows and columns both 
represent arcs and whose elements hold a one if and 
only if the corresponding two arcs share a node in 
5 common. The corresponding weight matrix represents 
the true distance off the main diagonal. In this 
embodiment, the rows and columns are swapped at the 
same time, i.e. when swapping rows i and j columns i 
and j are also swapped, when finding the optimal 
10 score. 

A variety of hardware systems including general 
purpose computer systems and specialized systems may 
be employed to automatically design a desired 

15 structure by deriving a genome representation 

according to the present teachings. The present 
techniques decrease the computational time on the 
hardware system employed to automatically determine 
the genome representation and automatically evolve 

20 the weights. 

The foregoing detailed description of the 
present invention is provided for the purposes of 
illustration and is not intended to be exhaustive or 
25 to limit the invention to the precise embodiment 
disclosed. Accordingly, the scope of the present 
invention is defined by the appended claims. 
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